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Abstract 

A probabilistic cellular automaton (PCA) can be viewed as a Markov chain. The cells 
are updated synchronously and independently, according to a distribution depending on a 
finite neighborhood. We investigate the ergodicity of this Markov chain. A classical cellular 
automaton is a particular case of PCA. For a 1-dimcnsional cellular automaton, we prove 
that ergodicity is equivalent to nilpotency, and is therefore undecidable. We then propose 
\Q an efficient perfect sampling algorithm for the invariant measure of an ergodic PCA. Our 

y—i algorithm does not assume any monotonicity property of the local rule. It is based on a 

bounding process which is shown to be also a PCA. Last, we focus on the PCA Majority, 
whose asymptotic behavior is unknown, and perform numerical experiments using the perfect 
sampling procedure. 
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1 Introduction 

A deterministic cellular automaton (DC A) consists of a lattice (e.g. Z or Z 2 or Z/nZ) divided 
in regular cells, each cell containing a letter of a finite alphabet. The cells evolve synchronously, 
each one evolving in function of a finite number of cells in its neighborhood, according to a local 
rule. 

DCA form a natural mathematical object: by Hedlund's theorem |18] . the mappings realized 
by DCA are precisely the continuous functions (for the product topology) commuting with the 
shift. They also constitute a powerful model of computation, in particular they can "simulate" 
any Turing machine. Last, due to the amazing gap between the simplicity of the definition 
and the intricacy of the generated behaviors, DCA are good candidates for modelling "complex 
systems" appearing in physical and biological processes. 

Probabilistic cellular automata. To take into account random events, one is led to consider 
probabilistic versions of DCA. In one of them, at most one cell is updated at each time, this 
cell being randomly chosen according to a given distribution. For an infinite set of cells, one 
is led to consider continuous time models, obtaining what is known in probability theory as 
an interacting particle system [23]. Another model is that of probabilistic cellular automata 
(PCA) [3T]. For PCA, time is discrete, and all the cells evolve synchronously as for DCA, but 
the difference is that for each cell, the new content is randomly chosen, independently of the 
others, according to a distribution depending only on a finite neighborhood of the cell. 

Let us mention a couple of motivations. First, the investigation of fault-tolerant computational 
models was the motivation for the Russian school to study PCA |31| IT2] . Second, PCA appear 
in combinatorial problems related to the enumeration of directed animals [9] [21 I22J . Third, 
in the context of the classification of DCA (Wolfram's program), robustness to random errors 
can be used as a discriminating criterion |11| 126] . Recently, PCA also proved to be pertinent 
for the density classification problem [10J, that is, testing efficiently if some sequence contains 
more occurrences of or 1. Last, PCA are used in statistical physics and in life sciences. 
They modelize various phenomena, from the dynamical properties of the neural tissue |21j to 
competition between species. 

We focus our study on the equilibrium behavior of PCA. Observe that a PCA may be viewed 
as a Markov chain over the state space A E , where A is the alphabet and E is the set of cells. 
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So the equilibrium is studied via the invariant measures of the Markov chain. Several questions 
are in order. 

Ergodicity. A PCA is ergodic if it has a unique and attractive invariant measure. A challenging 
problem in this area is the positive rates conjecture. A PCA is said to have positive rates if 
for any neighborhood, the updated content of a cell can be any letter with a strictly positive 
probability. The positive rates conjecture states that any one-dimensional (E = Z) model with 
positive rates is ergodic. Gacs exhibited in 2001 a very large and complex counter-example 
in a paper of more than 200 pages [12] which was published with an introductory article by 
Gray |13j . This counter-example is far from being completely understood and several questions 
remain. For instance, does the conjecture hold true for small alphabets and neighborhoods? 
Even for alphabets and neighborhoods of size 2, the question is not settled. 

Performance evaluation. The second natural question is whether the invariant measures can 
be evaluated. A PCA with an alphabet and a neighborhood of size 2 is determined by four 
parameters. If the parameters satisfy a given polynomial equation, there exists an invariant 
measure with an explicit product form |3H Chapter 16]. Under another polynomial condition, 
there exists an invariant measure with an explicit Markovian form, see [3H Chapter 16] or [2]. 
What happens for generic values of the parameters? Or for PCA with a larger neighborhood 
or alphabet? When explicit computation is not possible, simulation becomes the alternative. 
Simulating PCA is known to be a challenging task, costly both in time and space. Also, 
configurations cannot be tracked down one by one (there is an infinite number of them when 
E = Z) and may only be observed through some measured parameters. So the crucial point is 
whether some guarantees can be given upon the results obtained from simulations. 

The contributions of the present paper are as follows: 

First on ergodicity. We prove that the ergodicity of a DCA on Z is undecidable. This was 
mentioned as Unsolved Problem 4-5 in [3D]. Since a DCA is a special case of a PCA, it also 
provides a new proof of the undecidability of the ergodicity of a PCA (Kurdyumov, see |31|, 
Chap. 14], and Toom [29]). 

Second on performance evaluation. Given an ergodic PCA, a perfect sampling procedure is a 
random algorithm which returns a configuration distributed according to the invariant measure. 
By applying the procedure repeatedly, we can estimate the invariant measure with arbitrary 
precision. We propose such an algorithm for PCA by adapting the coupling from the past 
method of Propp & Wilson |24| . When the set of cells is E = Z/nZ, a PCA is a finite state 
space Markov chain. Therefore, coupling from the past from all possible initial configurations 
provides a basic perfect sampling procedure. But a very inefficient one since the number of 
configurations is exponential in n. Here, the contribution consists in simplifying the procedure. 
We define a new PCA on an extended alphabet, called the envelope PCA (EPCA). We obtain 
a perfect sampling procedure for the original PCA by running the EPCA on a single initial 
configuration. When the set of cells is E = Z rf , a PCA is a Markov chain on an uncountable 
state space. So there is no basic perfect sampling procedure anymore. We prove the following: 
If the PCA is ergodic, then the EPCA may or may not be ergodic. If it is ergodic, then we 
can use the EPCA to design an efficient perfect sampling procedure (the result of the algorithm 
is the finite restriction of a configuration with the right invariant distribution). In the case 
E = Z, we give a sufficient condition for the EPCA to be ergodic. The EPCA can be viewed 
as a systematic treatment of ideas already used by Toom for percolation PCA (see for instance 
[301 Section 2]). 

The perfect sampling procedure can also be run on a PCA whose ergodicity is unknown, with 
the purpose of testing it. We illustrate this approach on Majority, prototype of a PCA whose 
equilibrium behavior is not well understood. More precisely, we define a parametrized family 
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of PCA, called Majority(a), a G (0, 1). We prove that for a large enough, the PCA has several 
invariant measures. We conjecture the existence of a phase transition between two situations: 
(i) several invariant measures; (ii) a unique but non-attractive invariant measure. We provide 
some numerical evidence for the phase transition, which would be the first example of this kind. 
In fact, the mere existence of a PCA satisfying (ii) had been a long standing open question 
which was recently answered by the positive [5]. 

Section [2] gives the basic definitions. Section [3] is devoted to the ergodicity problem. Section [4] 
presents the perfect sampling procedures. Last, Section [5] is devoted to the case study of the 
Majority PCA. 

A short version without proofs of the paper appears in the proceedings of the STACS'2011 
Conference [3J. 

2 Probabilistic cellular automata 

Let A be a finite set called the alphabet, and let E be a countable or finite set of cells. We 
denote by X the set A E of configurations. 

We assume that E is equipped with a commutative semigroup structure, whose law is denoted 
by +. In examples, we consider mostly the cases E = Z or E = 7Ljn7L. Given K C E and 
V C E, we define 

V + K = {u + v e E \ u e V,v e K} . 

A cylinder is a subset of X having the form {x G X \ \/k G K, = for a given finite subset 
K of E and a given element (yk)keK G A ■ When there is no possible confusion, we shall 
denote briefly by yx the cylinder {x G X | V7c G K, x^ = y^}. For a given finite subset K, we 
denote by C(K) the set of all cylinders of base K. 

Let us equip X = A E with the product topology, which can be described as the topology 
generated by cylinders. We denote by M.(A) the set of probability measures on A and by 
M.(X) the set of probability measures on X for the cr-algebra generated by all cylinder sets, 
which corresponds to the Borelian cr-algebra. For x G X, denote by 5 X the Dirac measure 
concentrated on the configuration x. 

Definition 2.1 Given a finite set V C E, a transition function of neighborhood V is a func- 
tion f : A v — > M{A). The probabilistic cellular automaton (PCA) P of transition function f 
is the application 

P:M(X) ->M(X) 
H 1 y \iP , 

defined on cylinders by: 

^P{Vk) = K x v+k) J| f((xk+v)vev)(yk) • 

x v+K £C(V+K) k€K 

Let us look at how P acts on a Dirac measure S z . The content of the k-th cell is changed into 
the letter a £ A with probability f((zk+ v )vev)( a ), independently of the evolution of the other 
cells. The real number f ({zk+ v ) V £_v)( a ) G [0, 1] is thus to be thought as the conditional proba- 
bility that, after application of P, the fc-th cell will be in the state a if, before its application, 
the neighborhood of k was in the state (zk+ v ) v ev- 

Let u be the uniform measure on [0, 1]. We define the product measure r = ^i e E u on IP> • 
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Definition 2.2 An update function of the probabilistic cellular automaton P is a deterministic 
function <f> : A E x [0, 1] E — > A (the function (f> takes as argument a configuration and a sample 
in [0, 1] , and returns a new configuration), satisfying for each x G A E , and each cylinder yx, 

r({r G [0, 1} E ; <f>(x, r) G y K }) = \\ /((** 

In practice, it is always possible to define an update function cf) for which the value of (f>(x, r) k 
only depends on (x k+v ) v& y and on r k . For example, if the alphabet is A = {di, . . . , a n }, one 
can set 

at if < r k < f{(x k+v ) v(E v)(ai) 

a-2 if f((x k+v ) veV )(ai) < r k < f((x k+v ) veV )({a 1 ,a 2 }) 



(1) 

. a n if f(.(xk+v)vev({a>i,a>2, ■ ■ ■ ,On-i}) < r k < 1. 



For a given initial configuration x° G and samples (r*)t e N; r * £ [0, 1] E ' , let (a;*)t e pj G (^4 S ) N 
be the sequence defined recursively by 

x t+1 =<j ) ( x \r t ). 

Such a sequence is called a space-time diagram. It can be viewed as a realization of the Markov 
chain. Examples of space-time diagrams appear in Figures [T] and [9| 

Classical cellular automata are a specialization of PCA. 

Definition 2.3 A deterministic cellular automaton (DC A) is a PCA such that for each se- 
quence (x v ) vl =v ^ A v , the measure f((x v ) v& v) is concentrated on a single letter of the alphabet. 
A DC A can thus be seen as a deterministic function F : A E — > A E . 

In the literature, the term cellular automaton denotes what we call here a DCA. Deterministic 
cellular automata have been widely studied, in particular on the set of cells E = Z, see Section[3j 
For a DCA, any initial configuration defines a unique space-time diagram. 

Example 2.4 Let A = {0,1}, E = Z, and V = {0,1}. Consider < e < 1 and the local 
function 

f{x,y) = (1 — e) S x+y mo d2 + £<Wy+lmod2 • 

This defines a PCA that can be considered as a perturbation of the DCA F : A E — > A E defined 
by F(x)i = Xi + mod 2, with errors occurring in each cell independently with probability e. 

Example 2.5 Let A = {0, 1}, E = Z d , and let V be a finite subset of E. Consider < a < 1 
and the local function: 

f((x v ) ve v) = « 5max(x„, veV) + (1 - a) 5 . 

The corresponding PCA is called the percolation PCA associated with V and a. The particular 
case of the space E = Z and the neighborhood V = {0, 1} is called the Stavskaya PCA. In 
Figure [TJ we represent two space-time diagrams of the percolation PCA for V = { — 1,0, 1}. 
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(a) a = 0.5 (b) a = 0.6 

Figure 1: Space-time diagrams of the PCA of Example 2.5 for V = {— 1, 0, 1} 



Invariant measures and ergodicity 

A PCA can be seen as a Markov chain on the state space A E . We use the classical terminology 
for Markov chains that we now recall. 

Definition 2.6 A probability measure it G M(X) is said to be an invariant measure of the 
PCA P if ttP = it. The PCA is ergodic if it has exactly one invariant measure ir which is 
attractive, that is, for any measure \i G Ai(X), the sequence \xP n converges weakly to ir (i.e. 
for any cylinder C, lim n _ s>+00 [iP n (C) = tt(C)). 

A PCA has at least one invariant measure, and the set of invariant measures is convex and 
compact. This is a standard fact, based on the observation that the set Ai(X) of measures on 
X is compact for the weak topology, see for instance |31j . Therefore, there are three possible 
situations for a PCA: 

(i) several invariant measures; 

(ii) a unique invariant measure which is not attractive; 

(Hi) a unique invariant measure which is attractive (ergodic case). 



Example 2.7 Consider the PCA of Example 2.4 Using the results in [31, Chapters 16 and 



17], one can prove that the PCA is ergodic and that its unique invariant measure is the uniform 
mesure, i.e. the product of Bernoulli measures of parameter 1/2. 



Example 2.8 Consider the percolation PCA of Example |2.5| Observe that the Dirac measure 
5 e is an invariant measure. Using a coupling with a percolation model, one can prove the 
following, see for instance [30l Section 2]. There exists a* G (0, 1) such that: 

a < a* (Hi) : ergodicity 

a > a* (i) : several invariant measures. 

The exact value of a* is not known but it satisfies 1/|V| < a* < 53/54. 

The existence of a PCA corresponding to situation (ii) had been a long standing conjecture 
(Unsolved problem 5.7 in Toom |30|). In [5], it is proved that situation (ii) occurs for the 
PCA on {0, 1} Z with neighborhood V = {0,1}, and local function a defined by a(00)(l) = 
1/2, a(01)(l) = 0, a(10)(l) = 1, a(ll)(l) = 1/2. 



The PCA of Example 2.5 exhibits a phase transition between the situations (i) and (Hi). In 
Section [5j we study a PCA that may have a phase transition between the situations (ii) and 
(Hi). It would provide the first example of this type. 
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3 Ergodicity of DCA 



DCA form the simplest class of PCA, it is therefore natural to study the ergodicity of DCA. In 
this section, we prove the undecidability of ergodicity for DCA (Theorem 3.4). This also gives 
a new proof of the undecidability of the ergodicity for PCA. 



Remark. In the context of DCA, the terminology of Definition 2.6 might be confusing. Indeed 
a DCA P can be viewed in two different ways: 

(i) a (degenerated) Markov chain; (ii) a symbolic dynamical system. 

In the dynamical system terminology, P is uniquely ergodic if: [3!/x, \iP = In the Markov 
chain terminology (that we adopt), P is ergodic if: fiP = fj] and [W, vP n — > /j], where 

stands for the weak convergence. Knowing if the unique ergodicity (of symbolic dynamics) 
implies the ergodicity (of the Markovian theory) is an open question for DCA. 

The limit set of P is defined by 

LS= p| P n (A E ) . 

neN 

In words, a configuration belongs to LS if it may occur after an arbitrarily long evolution of 
the cellular automaton. 

Observe that LS is non-empty since it is the decreasing limit of non-empty closed sets. A con- 
structive way to show that LS is non-empty is as follows. The image by P of a monochromatic 
configuration x E is monochromatic: x E — > y E . In particular there exists a monochromatic 
periodic orbit for P, and we have: 

x E — > xf —>■••■—>■ x E _i — > x E ==^ {x E , xf , . . . , x E _i\ C LS . (2) 

Recall that 5 U denotes the probability measure concentrated on the configuration u. The peri- 
odic orbit (xq , . . . ,x|'_ 1 ) provides an invariant measure given by (5 x e + . . . + 5 x b )/k. More 
generally, the support of any invariant measure is included in the limit set. 

Definition 3.1 A DCA is nilpotent if its limit set is a singleton. 



Using ([2]), we see that a DCA is nilpotent iff LS = {x E } for some x 6 A. The following stronger 
statement is proved in [8], using a compactness argument: 

[ P nilpotent ] [ 3x £ A, 3N £ N, P N (A E ) = {x E } } . 

We get next proposition corollary. 

Proposition 3.2 Consider a DCA P. We have: 

[ P nilpotent ] ==^ [ P ergodic ] . 

Proof. Let x £ A and N G N be such that P N (A E ) = {x E }. For any probability measure fi on 
A E , we have [iP N = 5 x e. Therefore, P is ergodic with unique invariant measure 5 x e. □ 

If we restrict ourselves to DCA on Z, we get the converse statement. 
Theorem 3.3 Consider a DCA P on the set of cells Z. We have: 

[ P nilpotent ] <^=^ [ P ergodic ] . 
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Proof. Let P be an ergodic DCA. Assume that there exists a monochromatic periodic orbit 
(xq , . . . , x%_i) with k > 2. Then n = (5 x z + ••• + S x z )/k is the unique invariant measure. The 
sequence S x zP n does not converge weakly to /i, which is a contradiction. Therefore, there exists 
a monochromatic fixed point: P(x z ) = and 5 x z is the unique invariant measure. 

Define the cylinder C = {v E A z \ Vi E K, V{ = x}, where K is some finite subset of Z. For 
any initial configuration u € using the ergodicity of P, we have: 

8 u P n (C) — > ^ Z (C) = 1 . 

But <5 U -P n is a Dirac measure, so 8 u P n (C) is equal to or 1. Consequently, we have 5 u P n (C) = 1 
for n large enough, that is, 

3N € N, Vn > A r , V« G P n (u)i = x . 

In words, in any space-time diagram of P, any column becomes eventually equal to xxx 
Using the terminology of Guillon & Richard [15], the DCA P has a weakly nilpotent trace. It is 
proved in |15j that the weak nilpotency of the trace implies the nilpotency of the DCA. (The 
result is proved for cellular automata on Z and left open in larger dimensions.) This completes 
the proof. □ 

Kari proved in [20] that the nilpotency of a DCA on Z is undecidable. (For DCA on Z rf , d > 2, 



the proof appears in [8].) By coupling Kari's result with Theorem 3.3, we get: 



Corollary 3.4 Consider a DCA P on the set of cells Z. The ergodicity of P in undecidable. 

The undecidability of the ergodicity of a PCA was a known result, proved by Kurdyumov, 
see [31] . see also Toom [29]. Kurdyumov's and Toom's proofs use a non-deterministic PCA of 
dimension 1 and a reduction of the halting problem of a Turing machine. 



Corollary 3.4 is a stronger statement. In fact, the (un) decidability of the ergodicity of a DCA 



was mentioned as Unsolved Problem 4-5 in [30] . We point out that Corollary 3.4 can also be 



obtained without Theorem 3.3 by directly adapting Kari's proof to show the undecidability of 



the ergodicity of the DCA associated with a NW-deterministic tile set. 



4 Sampling the invariant measure of an ergodic PCA 

Generally, the invariant measure(s) of a PCA cannot be described explicitly. Numerical simu- 
lations are consequently very useful to get an idea of the behavior of a PCA. Given an ergodic 
PCA, we propose a perfect sampling algorithm which generates configurations exactly according 
to the invariant measure. 

A perfect sampling procedure for finite Markov chains has been proposed by Propp & Wilson 
[2"4] using a coupling from the past scheme. Perfect sampling procedures have been developed 
since in various contexts. We mention below some works directly linked to the present article. 
For more information see the annotated bibliography: Perfectly Random Sampling with Markov 
Chains, http : / / dimacs . rutgers . edu/~dbwilson/ exact . html/ , 

The complexity of the algorithm depends on the number of all possible initial conditions, which 
is prohibitive for PCA. A first crucial observation already appears in [24] : for a monotone 
Markov chain, one has to consider two trajectories corresponding to minimal and maximal 
states of the system. For anti-monotone systems, an analogous technique has been developed 
by Haggstrom & Nelander [17] that also considers only extremal initial conditions. To cope with 
more general situations, Huber [19] introduced the idea of a bounding chain for determining 
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when coupling has occurred. The construction of these bounding chains is model-dependent 
and in general not straightforward. In the case of a Markov chain on a lattice, Busic et al. [3] 
proposed an algorithm to construct bounding chains. 

Our contribution is to show that the bounding chain ideas can be given in a particularly simple 
and convenient form in the context of PC A via the introduction of the envelope PC A. 



4.1 Basic coupling from the past for PC A 
4.1.1 Finite set of cells 

Consider an ergodic PCA P on the alphabet A and on a finite set of cells E (for example 
Z m = Z/mZ). Let ir be the invariant measure on X = A E . A perfect sampling procedure is a 
random algorithm which returns a state x £ X with probability tt(x). Let us present the Propp 
& Wilson, or coupling from the past (CFTP), perfect sampling procedure. 

Algorithm 1: Basic CFTP algorithm for a finite set of cells 

Data: An update function (j) : X x [0, 1] E — > X of a PCA. A family (f^ n )(fc,n)e-ExN °f i-i-d. 
r.v. with uniform distribution in [0, 1]. 

begin 

t = l ; 
repeat 

R-t = A ; 

for j = —t to —1 do 

I Rj+i = {<l>( x , ( r i)ieE) ; x e Rj} 

end 

t = t + 1 
until |i?o| = lj 

return the unique element of R$ 
end 

The good way to implement this algorithm is to keep track of the partial couplings of trajectories. 
This allows to consider only one-step transitions. 

Proposition 4.1 (|24j) // the procedure stops almost surely, then the PCA is ergodic and the 
output is distributed according to the invariant measure. 

In Figure [2j we illustrate the algorithm on the toy example of a PCA on the alphabet {0, 1} 
and the set of cells Z2. The state space is thus X = {xi = 00, X2 = 01, X3 = 10, X4 = 11}. On 
this sample, the algorithm returns X2- 

A sketch of the proof of Proposition 4.1 can be given using Figure [2j On the last of the four 



pictures, the Markov chain is run from time -4 onwards and its value is X2 at time 0. If we had 
run the Markov chain from time —00 to 0, then the result would obviously still be X2- But if 
we started from time —00, then the Markov chain would have reached equilibrium by time 0. 



4.1.2 Infinite set of cells 



Assume that the set of cells E is infinite. Then a PCA defines a Markov chain on the infinite 
state space X = A E , so the above procedure is not effective anymore. However, it is possible 
to use the locality of the updating rule of a PCA to still define a perfect sampling procedure. 
(This observation already appears in pp.) 
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Let P be an ergodic PCA and denote by ir its invariant distribution. In this context, a perfect 
sampling procedure is a random algorithm taking as input a finite subset K of E and returning 
a cylinder xk £ C(K) with probability tt(xk)- 

To get such a procedure, we use the following fact: if the PCA is run from time —k onwards, 
then to compute the content of the cells in K at time 0, it is enough to consider the cells in the 
finite dependence cone of K. This is illustrated in Figure [3] for the set of cells E = Z and the 
neighborhood V = { — 1,0, 1}, with the choice K = {0}. 
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Figure 3: Dependence cone of a cell 



Observe that the orientation has changed with respect to Figure [2] in order to be consistent 
with the convention of Figures [T] and [9] for space-time diagrams. 

Let us define this more formally. Let V be the neighborhood of the PCA. Given a subset K 
of E, the dependence cone of K is the family (V-t(K))t>o of subsets of E defined recursively 
by V (K) = K and V-t(K) = V + V-t+i(K). Let <f> : X x [0, 1} E -f X be an update function, 
for instance the one defined according to ([TJ. For a given subset K of E, we denote </>_t : 
jy-t(K) x [ Q) \]V- t {K) _^ ^V-t+i(K) the corresponding restriction of 0. 
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With these notations, the algorithm can be written as follows. 



Algorithm 2: Basic CFTP algorithm for an infinite set of cells 

Data: An update function <p '• A x [0, l] E — > X of a PCA. A family (^ n )(fc,n)e_ExN of i.i.d. 
r.v. with uniform distribution in [0, 1]. A finite subset K of E. 

begin 

V (K) = K ; 
t = l ; 
repeat 

V- t (K) = V + V- t+ i(K) ; 
R_ t = A v -W ; 

for j = —t to —1 do 

| R j+1 = {^(x, (i{) ieVj{K) ) ; x G Rj} C /w<«) 
end 

i = i + 1 
until | jF2q| = lj 

return i/ie unique element of R$ 
end 



Next proposition is an easy extension of Proposition 4.1 



Proposition 4.2 If the procedure stops almost surely, then the PCA is ergodic and the output 
is distributed according to the marginal of the invariant measure. 



4.2 Envelope probabilistic cellular automata (EPCA) 

The CFTP algorithm is inefficient when the state space is large. This is the case for PCA: when 
E is finite, the set A E is very large, and when E is infinite, it is the number of configurations 
living in the dependence cone described above which is very large. We cope with this difficulty 
by introducing the envelope PCA. 

To begin with, let us assume that P is a PCA on the alphabet A = {0, 1} (as previously, the 
set of cells is denoted by E, the neighborhood by V C E, and the local function by /). The 
case of a general alphabet is treated in Section |4.5| 



Definition of the EPCA. Let us introduce a new alphabet: 

B = {0,1,7}. 

A word on B is to be thought as a word on A in which the letters corresponding to some 
positions are not known, and are thus replaced by the symbol "?" . Formally we identify B with 
2-^ — as follows: = {0}, 1 = {1}, and ? = {0, 1}. So each letter of B is a set of possible letters 
of A. With this interpretation, we view a word on B as a set of words on A. For instance, 

71? = {010,011,110,111}. 



We will associate to the PCA P a new PCA on the alphabet B, that we call the envelope 
probabilistic cellular automaton of P. 

Definition 4.3 The envelope probabilistic cellular automaton (EPCA) of P, is the PCA 

env(P) of alphabet B, defined on the set of cells E, with the same neighborhood V as for P, and 
a local function env(f) : B v — > M.(B) defined for each y G B v by 
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env(f)(y)(0) 
env(f){y)(l) 



min f(x)(0) 
xeA v , x€y 

min f(x)(l) 

x&A v , x&y 



enu(/)(y)(?) = 1 — min /(x)(0) — min f(x)(l). 

xeA v ,xey x£A v ,x£y 

We point out that m.m xe ^v t x£y f(x)(l) + max xe ^y x£y /(x)(0) = 1, so that the last quantity 
env(/)(y)(?) is non-negative. 

Moreover, env(P) acts like P on configurations which do not contain the letter "?". More 
precisely, 

VyeA V , env(/)(y)(O) = /(y)(0), env(/)(y)(l) = /(y)(l), env(/)(y)(?) = . (3) 
In particular, we get the following. 

Proposition 4.4 If the EPCA env(P) is ergodic then the PC A P is ergodic. 

Proof. According to ([3]), any invariant measure of P corresponds to an invariant measure of 
env(P). Therefore, if P has several invariant measures, so does env(P). Assume that P has a 
unique invariant measure [/, which is non-ergodic. Let fj,Q be such that HoP n does not converge 
to fi. Then fiQ env(P) ra does not converge either, see (§. To summarize, we have proved that 
P non-ergodic implies env(P) non-ergodic. □ 



The converse of Proposition |4.4| is not true and counter-examples will be given in Section 4.3.3 



Construction of an update function for the EPCA. Let us define the update function 

4> : B E x [0, if -> B E 

of the PCA env(P), by: 



if < r k < env(f)((y k+v ) vev )(0) 

1 if 1 - env(f)((y k+v ) veV )(l) < r k < 1 
? otherwise. 



</>{y,r)k = 

The value of <ft(y, r) k in function of r k can thus be represented as follows. 



(4) 






? 


i 


ii i 



''A- 



min f{x){Q) 

xeA v , x£(y k+v )v€V 



xeA v , xe{y k+v )v€v 



For a PCA of neighborhood V = {0, 1}, we represent below the construction of the updates of 
the EPCA when the value of the neighborhood is 0?. 
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env U A u - A U J 


el "KJj( U! )\ ! ) env UA u - A 1 ] 





? 1 





1 j 




/(01)(0) 





1 1 



^ /(od)(o) 



Let be the natural update function for the PCA P defined as in ([T]). Observe that the function 
(ft coincides with (ft on configurations which do not contain the letter "?". Furthermore, we have: 

Vr G [0, 1] E , \/x G A E , My G x £ y ==^ (ft(x,r) G (f>(y,r) . (5) 



4.3 Perfect sampling using EPCA 

We propose two perfect sampling algorithms, for a finite and for an infinite number of cells. We 
show that in both cases, the algorithm stops almost surely if and only if the EPCA is ergodic 
(Theorem 4.5 ). The ergodicity of the EPCA implies the ergodicity of the PCA but the converse 
is not true: we provide a counterexample for each case, finite and infinite (Section |4.3.3 ).We 
also give conditions of ergodicity of the EPCA (Prop. 4.6 and 4.7). 



4.3.1 Algorithms 

Finite set of cells. The idea is to consider only one trajectory of the EPCA - the one that 
starts from the initial configuration ? E (coding the set of all configurations of the PCA). The 
algorithm stops when at time 0, this trajectory hits the set A E '. 

Algorithm 3: Perfect sampling using the EPCA for a finite set of cells 

Data: The pre-computed update function (ft. A family (r^ n )^k,n)eExN °f i-i-d. r.v. with 
uniform distribution in [0, 1]. 

begin 

t = l; 
repeat 

c = l E ■ 

for j = —t to —1 do 

I c = 4>(c, (rj ) ieE ) 
end 

t = t + l 
until c G A E ; 

return c 
end 
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Infinite set of cells. Once again, we consider only one trajectory of the EPCA. 



Algorithm 4: Perfect sampling using the EPCA for an infinite set of cells 

Data: The pre-computed update function <fi. A family (^ n )(^ n )g£ X N of i.i.d. r.v. with 
uniform distribution in [0, 1]. A finite subset K of E. 

begin 

V (K) = K ■ 
t = l; 
repeat 

V- t {K) = V + V„ t+1 (K) ; 

c _ ? V_ t (J0 ; 

for j = —t to —1 do 

| c = h(c,(ri) ieV . {K) )eB v ^W 

end 

t = t + 1 
until c G „4^; 

return c 
end 



Theorem 4.5 Algorithm^ resp. stops almost surely if and only if the EPCA is ergodic. In 
that case, the output of the algorithm is distributed according to the unique invariant measure 
of the PC A. 

Proof. The argument is the same in the finite and infinite cases. We give it for the finite case. 
Assume first that Algorithm [3] stops almost surely. By construction, it implies that for all fj,Q, 
the measure env(P) n is asymptotically supported by A E . Therefore, we can strengthen the 



result in Proposition 4.4 the invariant measures of env(P) coincide with the invariant measures 
of P. In that case, env(P) is ergodic iff P is ergodic. Using ([5]), the halting of Algorithm [3] 
implies the halting of Algorithm [IJ Furthermore, if we use the same samples (rjJT n )(fc,n)e-ExN> 
Algorithms [3] and [T] will have the same output. According to Proposition |4.1[ this output is 
distributed according to the unique invariant measure of P. In particular, P is ergodic. So 
env(P) is ergodic. 

Assume now that the EPCA is ergodic. The unique invariant measure ir of env(P) has to be 
supported by A E . Also, by ergodicity, we have 5 7 e env (P) n tt. This means precisely that 
the Algorithm [3] stops a.s. □ 



4.3.2 Criteria of ergodicity for the EPCA 

Finite set of cells. In the next proposition, we give a necessary and sufficient condition for 
the EPCA to be ergodic. In particular, this condition is satisfied if the PCA has positive rates 
(see the Introduction). 

Proposition 4.6 The EPCA env(P) is ergodic if and only if env(f)(? v )(?) < 1. This condition 
can also be written as: 

min /(x)(0) + min /(x)(l) > 0. (6) 
x&A v xeA v 

Proof. If env (f)(? v )(?) = 1, then for almost any r € [0, 1] , we have 4>(? E ,r) = ? E , so that at 
each step of the algorithm, the value of c is ? E with probability 1. 

Conversely, if we assume for example that p = min^g^v /(x)(0) > 0, then for any configuration 
d G B E , the probability to have 4>(x,r) = E is greater than p\ E \, so that the algorithm stops 
almost surely, and the expectation of the running time can be roughly bounded by □ 
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Infinite set of cells. For an infinite set of cells the situation is more complex. The condition 



of Proposition 4.6 is not sufficient to ensure the ergodicity of the EPCA. A counter-example 



is given in Section 4.3.3 First, we propose a rough sufficient condition of convergence for 
Algorithm [4j 

Proposition 4.7 Let a* G (0, 1) be the critical probability of the percolation PCA with neigh- 
borhood V , see Examples\2.h\ and\2.£\ The EPCA env(P) is ergodic if 



em,(/)(? y )(?) < a* (7) 

and non- ergodic if 

min env(f)(x)(?) > a*. (8) 
x£B v -A v 

Proof. Recall that B = {0,1,?}. Define C = {d,?}, with d = {0,1}. A word over C is 
interpreted as a set of words over B, for instance, d? = {0?, 1?}. The symbol d stands for 
determined letter, as opposed to ? which represents an unknown letter. 

We define a new PCA Q on the alphabet C, with the same neighborhood V as P and env(P), 
and with the transition function g : C v — > Ai(C) defined by: 

g(d ) = 5d, and Vu G C v — {d^}, g(u) = a5? + (1 — a)5d , 

for a = max IgB v env(/)(x)(?) = env(/)(? y )(?). 

Observe that 5 a e is an invariant measure of Q. Recall that is an update function of env(P), 
see Q. Given the way Q is defined, we can construct an update function 4>q of Q such that 

Vx G B E , My G C E , Mr G [0, 1] E , x G y => $(x, r) G <j> Q (y, r) . (9) 

In particular, assume that Q is ergodic. Then 5^e Q n — ^> $d E - Using (|9j), it implies that 
Algorithm|4]stops almost surely, and env(P) is ergodic according to Theorem 4.5 To summarize, 
the ergodicity of Q implies the ergodicity of env(P). 



Observe that the PCA Q is a percolation PCA as defined in Example 2.5 (here, d plays the 
role of and ? plays the role of 1). Let a* G (0, 1) be the critical probability of the percolation 
PCA with neighborhood V, see Example |2.8[ For a < a* , the percolation PCA Q is ergodic. 
This completes the proof of ([7]). 

Define a PCA R on the alphabet C, with neighborhood V, and with the transition function: 

h(d v ) = 6 d , and Vu G C v — {d v }, h(u) = /3<5? + (1 - (3)5 d , 

for /3 = mm xel3 v _^v env(/)(x)(?). Given the way R is defined, we can construct an update 
function (j)R of R such that 

VxeB E ,Vy€C E ,Vre[0,l} E ,VkeE, [x G y, <f> R {y, r) k = ?] =>fa,r) k = l. 

Therefore, the ergodicity of env(P) implies the ergodicity of R. Equivalently, the non-ergodicity 
of R implies the non-ergodicity of env(P). Observe that the PCA R is a percolation PCA. 
Therefore, for /3 > a*, the percolation PCA R is non-ergodic. This completes the proof of Q. 

□ 
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4.3.3 Counter-examples 



Recall Proposition 4.4 [ EPCA ergodic 
is not true. 



PCA ergodic ]. We now show that the converse 



Example 4.8 Consider the PCA with alphabet A = {0, 1}, neighborhood V = {— 1, 0, 1}, set 
of cells E = Z/nZ, and transition function 



f(x,y,z) 



5i- y if xyz G {101, 010} 

aS y + (1 — a)8\- y otherwise , 



for a parameter a G (0,1). This is the PCA Majority studied in Section [5] For n odd, we 
prove in Proposition |5.1| that the PCA is ergodic. However the associated EPCA satisfies 



env(/)(???) = <5?. According to Proposition 4.6, the EPCA is not ergodic. 



Example 4.9 Consider the PCA of Example 2.4 This PCA has positive rates, in particular, 
it satisfies So the EPCA is ergodic on a finite set of cells. Now let the set of cells be Z. The 
PCA is ergodic for e G (0, 1), see Example 2.7 Consider the associated EPCA env(P). Assume 
for instance that e G (0, 1/2). We have 



env(/)(it) 



/(«) ifuG{0,l} y 
£<5o + s6i + (1 — 2e)(5? otherwise . 



By applying Proposition 4.7, env(P) is non-ergodic if 1 — 2s > a* . 



4.4 Decay of correlations 

In what follows, the set of cells is E = Z d , d > 1. It is easy to prove that the invariant measure 
of an ergodic PCA is shift-invariant. Using the coupling from the past tool, we give conditions 
for the invariant measure of an ergodic PCA to be shift-mixing. 

Definition 4.10 A measure \i on X = A^ d is shift-mixing if for any non-trivial translation 
shift t oflr, and for any cylinders U,V of X , 



lim n(U n T~ n (V)) = (x(U)n(V). 



(10) 



The proof of the following proposition is inspired from the proof of the validity of the coupling 
from the past method (see [23] or |16j). 



r"»(£) 



V-t,(K) 



Figure 4: Illustration of the proof of Proposition 



4.11 



Proposition 4.11 If Algorithm^ stops almost surely, then the unique invariant measure of 
the PCA is shift-mixing. It is in particular the case under condition |?|). 
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Proof. Assume that P is an ergodic PCA, and denote by ir its unique invariant measure. Let 
K and L be two finite subsets of E, and denote by xk and yi some cylinders corresponding to 
these subsets. Since the perfect sampling algorithm stops almost surely, for each e > 0, there 
exists an integer t £ such that with probability greater than 1 — e, the algorithm stops before 
reaching the time — t e when it is run for the set of cells K or for the set of cells L. If n G N d is 
large enough, we have: V-t e (K) n V-t E (r~ n (L)) = 0- 

Let Z be the output of the algorithm if it is asked to sample the marginals of n corresponding 
to the cells of K U r" n (L). 

Imagine running the PCA from time — 1 £ and set of cells V-t £ {K) U V-t £ ( T ~ n (L)) up to time 
0, using the same update variables as the ones used to get Z. Choose the initial condition at 
time — t £ as follows: independently on V-t e (K) and V-t e (T~ n (L)), and according to the relevant 
marginals of tt. Let X, resp. Y, be the output at time on the set of cells K, resp. r~ n (L). 
Observe that X and Y are distributed according to the marginals of tt. Furthermore, X and 
Y are independent since the dependence cones of K and T~ n (L) originating at time — t £ are 
disjoint. We therefore get 

tt(xk n r- n {y L )) - it(x K )it{y L ) = F{Z K = x K , Z T - n{L) = y L ) - F(X = x K )F(Y = y L ) 

= F(Z K = x K , Z T - n{L) = y L ) - P(X = x K ,Y = y L ) 
< F((Z K ,Z T -n( L) ) = (xK-, Vl) and (X,Y) / (x K ,y L )) 
<F((Z K ,Z T - n{L) )^(X,Y)) < 2s. 

In the same way, we get Tr(xK)^(yL) — k{xk H T~ n (yL)) < 2e. It completes the proof. □ 



In Proposition 4.11 the coupling from the past method is not used as a sampling tool but as 
a way to get theoretical results. Knowing if there exists an ergodic PCA having an invariant 
measure which is not shift-mixing is an open question (see [6] for details). 

4.5 Extensions 

In a PCA, the dynamic is homogeneous in space. It is possible to get rid of this characteristic 
by defining non-homogeneous PCA, for which the neighborhood and the transition function 



depend on the position of the cell. The definition below is to be compared with Definition 2.1 
The configuration space X = A E is unchanged. 

Definition 4.12 For each k £ E, denote by Vk C E the (finite) neighborhood of the cell k, and 
by fk '■ -A Vk — > A4(A) the transition function associated to k. Set V{K) = Ufcg^Vfc. The non- 
homogeneous PCA (NH-PCA) of transition functions (fk)keE is the application P : M.(X) — > 
M(X), fj, i y fiP, defined on cylinders by 

^P{vk) = K X V(K)) II fk((x v )v&v k )(yk) ■ 

xv{K)£C{V{K)) k&K 

Observe that it is not necessary for E to be equipped with a semigroup structure anymore. We 
use this below to define the finite restriction of a PCA. 

It is quite straightforward to adapt the coupling from the past algorithms to NH-PCA. More 



precisely, given a NH-PCA, we define the associated NH-EPCA by considering Definition 4.3 



and replacing V and env(/) by Vk and env(/)fc for each k G E. The algorithms of Section 



4.1 and 4.3.1 are then unchanged, and Proposition 4.4 and Theorem 4.5 still hold in the non- 



homogeneous setting. 
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In Section [5j we study the PCA Majority by approximating it by a sequence of NH-PCA. Let 
us explain the construction in a general setting. 

Let P be a PCA on the infinite set of cells E, with neighborhood V and transition function 
/ : A v -4 M{A). Let D be a finite subset of E. Define 

V{D) = {u G E | 3x G D, 3v G V U {0}, u = x + v}, B(D) = V{D) - D . 

The set B(D) is the boundary of the domain D. Fix a probability measure v on „4. The 
restriction of P associated with v and -D is the NH-PCA P{y, D) with set of cells V(D) and 
neighborhoods: 

Vu £D, K = M + V, V«G 5(D), K = ; 
and transition functions: 

Vu ED, f u = /, Vu G £(£>), /„(•) = i/ . 

In words, the boundary cells are i.i.d. of law v and the cells of D are updated according to P. 

If u is a probability measure on A s , where S is a finite subset of E, we define its extension fx 
on A E by setting, for a fixed letter a £ A: 

if Vi G D — 5 1 , Xi = a 
otherwise. 



Vx G .4^, 




Lemma 4.13 Let (-Di)ieN &e an increasing sequence of finite domains D{ C E such that 
UjgN-Di = E. Let (i^)ieN a sequence of probability measures on A. For each i, let Uj be 
an invariant measure of P(vi, Di) . Any accumulation point of the sequence (fii)ieH ^ s an in- 
variant measure of the original PCA P defined on E. 

Proof. Upon extracting a subsequence, we may assume that (/xj)j g N converges to jX G Ai(X). 
We need to prove that for any cylinder yx G C(K), we have fiP{yK) = ^{vk)- 
By definition, fj,jP(fj, Dj) = fij. Let the subset K of E and the cylinder yx G C(K) be fixed. 
If j is large enough, we have K C Dj and V{K) C Dj. So that ^(tjk) = fij(,UK) and P(uj,Dj) 
and P coincide on K. We deduce that fijP{yK) = fijiVK:)- By taking the limit on both sides, 
we get flP(yK) = Hvk)- □ 

Alphabet with more than two elements 

The EPCA and the associated algorithms have been defined on a two letters alphabet. It is 
possible to extend the approach to a general finite alphabet. 

Let A be the finite alphabet. Let P be a PCA with set of cells E, neighborhood V, and 
transition function / : A —> A4(A). 

Consider the alphabet B = 2^ — {0}, that is, the set of non-empty subsets of A. A word over 
B is viewed as a set of words over A. 

The EPCA env(P) associated with P is the PCA on the alphabet B with neighborhood V and 
transition function env(/) defined by: 

Vv G B v , Vy G B, env(/)0)(y) = V (-l)lvl-M mm f(u)(x) . 

For instance, we have: env(/)(v)({0, 1,2}) = ao,i,2 — "0,1 — &i,2 ~ Q o,2 + «o + «i + «2 with 
a 5 = mm uev f(u)({S}). 



The algorithms of Section 4.3 are unchanged. Observe however that the construction of an 



update function is not as natural as in the two- letters alphabet case. 
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5 The majority PCA: a case study 



The Majority PCA is one of the simplest examples of PCA whose behaviour is not well under- 
stood. Therefore, it provides a good case study for the sampling algorithms of Section |4j 

5.1 Definition of the majority PCA 

Given < a < 1, the PCA Majority (a), or simply Majority, is the PCA on the alphabet 
A = {0,1}, with set of cells E = Z (or Z/nZ), neighborhood V = {—1,0,1}, and transition 
function 

f{x, y,z) = a <Wj (w) + (1 - a) Si- y , 

where maj : A ?J — > _A is the majority function: the value of maj(x, y, z) is 0, resp. 1, if there are 
two or three 0's, resp l's, in the sequence x,y,z. The transition function of PCA Majority(a) 
can thus be represented as in Figure [5} It consists in choosing independently for each cell to 
apply the elementary rule 232 (with probability a) or to flip the value of the cell. 

The PCA Minority (a) has also been studied (see |26j). It is defined by the transition function 
g{x, y, z) = f(l - x, 1 - y, 1 - z). 



1 proba 1 — a 
proba a 




or 1 
or 1 



1 proba a 
proba 1 — a 




proba 1 



1 



1 proba 1 



1 1 



Figure 5: The transition function of the PCA Majority 



Let x = (01) z G {0, 1} Z be defined by: Vn £ Z, x 2n = 0, x 2n +i = 1. The configuration (10) z is 
defined similarly. Consider the probability measure 

fi = (5(oi)z + <5(io) z )/ 2 • (11) 

Clearly [i is an invariant measure for the PCA Majority. The question is whether there exists 
other invariant measures. 

To get some other insight on the question of invariant measures, consider the PCA Majority on 
the set of cells Z n = Z/raZ. This PCA has two completely different behaviors depending on the 
parity of n. 

Proposition 5.1 Consider the Markov chain on the state space {0, l} Zn which is induced by 
the Majority PCA. The Markov chain has a unique invariant measure v. If n is even then 
v = (5( 01 W2 + 5( 10 w2)/2; if n is odd then v is supported by {0, 1} Z ' 1 . 

Proof. Let Q be the transition matrix of the Markov chain. We are going to prove the following. 
If n is odd, then Q is irreducible and aperiodic. If n is even, the graph of Q has a unique terminal 
component consisting of the two points (01) n / 2 = 0101 ... 01 and (10) n / 2 = 1010 ... 10. 

Assume first that n is odd. From the configurations n = ... or l n = 1 ... 1, we can go to 
any configuration in one step. So, we only need to prove that from a given configuration, say 
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c £ {0, l} Zn , it is possible to reach n or l n . We assume that c is neither n nor 1™, and we 
consider the length L of the longest subsequence of identical bits in c. Since n is odd, there 
are at least two consecutive 0's or two consecutive l's. So, we have L > 2. Say that we have 
the pattern 10 L 1 in c. With probability (1 — a) 2 a L , the update of the corresponding cells will 
be: flip, majority, majority, majority, flip. Thus after one step, we can obtain configurations 
with the pattern L+2 (or n if L = n — 1). By iterating, we see that after at most n/2 steps, 
we can reach configuration n or l n . 

Assume now that n is even. Clearly 

°(oi) n / 2 Q = ^(10)™/ 2 ' ^(io) n / 2< 5 = "(oi) n / 2 • 

Thus, {(01)"/ 2 , (10)"/ 2 } is a terminal component of the graph of Q. Consider c £ {0, \] £n — 
{(01)™/ 2 , (10) n / 2 }. Then c has at least two consecutive 0's or l's. We can use the same argument 
as for n odd, to show that we can reach n or l n in at most n/2 steps. We conclude by observing 
that (01) n / 2 can be reached from n or 1" in one step. □ 



Let us come back to the PCA Majority on Z. The invariant measure u in (11) can be viewed 
as the "limit" over n of the invariant measures of the PCA on TLm- What about the "limits" of 
the invariant measures of the PCA on Z2„+i ? Do they define other invariant measures for the 
PCA onZ? 

One of the motivations of our work on perfect sampling algorithms for PCA was to test the 
following conjecture, which is inspired by the observations made in |25j and |26] on a PCA equi- 
valent to Majority. This conjecture concerns the existence of a "phase transition" phenomenon 
for the PCA Majority. 

Conjecture 5.2 There exists a c £ (0, 1) such that Majority(a) has a unique invariant measure 
if a < a c , and several invariant measures if a > a c . 

In the next subsection, we give some rigorous (but partial) results about the invariant measures 
of Majority(o). We first introduce a related PCA and use it to prove that if a is large enough, 
Majority(a) has indeed non-trivial invariant measures; we then present a dual model that could 
be used to have some information for small values of a. 

The last subsection is devoted to the experimental study of Majority(a) using the perfect 
sampling tools developed in the previous section. 

5.2 Theoretical study 

5.2.1 A related model: the "flip-if-not-all-equai" PCA 

Let us define as in [25], the PCA FINAE(a) of neighborhood V = {—1,0,1} and transition 
function g : {0, 1} V -)■ A^({0, 1}) given by 

g(x,y,z) = aSflip-if-not-all-equalC^,*) + t 1 ~ a ) 5 v > 

where the function flip-if- not-all-equal (FINAE), corresponding to the elementary cellular au- 
tomaton 178, is defined by 



flip-if-not-all-equal(x, y, z) - 
Clearly, 5 z and 5^ are invariant measures of the PCA. 



y if x = y = z 
1 — y otherwise. 
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Figure 6: The graph G 



Let us define flip-odd : {0, 1} Z -> {0, 1} Z and flip-even : {0, 1} Z -)■ {0, 1} Z by, for x = (xi) ie z, 

)xi if i is even J 1 - Xi if i is even 

flip-odd(x); = < flip-even(x)j = < . 

I 1 — X{ if i is odd I Xi if z is odd 

If we extend flip-odd and flip-even to mappings on A4({0, 1} Z ), we have 

Majority(a) = flip-odd o FINAE(a) o flip-even . 

This equality can be checked on the local functions of the PCA Majority(a) and FINAE(a). 
One thus obtains that if it is an invariant measure for FINAE(a), then 

(flip-odd(vr) + flip-even(vr))/2 

is an invariant measure for Majority (a). The invariant measures 5 z and <5jz of FINAE(a) 



correspond to the invariant measure /i in ( 11 ) for Majority(a), and the existence of a non-trivial 
invariant measure for FINAE(a) corresponds to the existence of a second invariant measure for 
Majority (a). 

5.2.2 Validity of the conjecture for large values of a 



The partial result of Proposition 5.3 relies on ideas from Regnault |25j . 



Proposition 5.3 Let p c be the percolation threshold of directed bond-percolation in N 2 . If 
a > ^/l — (1 — Pc) 4 , then Majority(a) has several invariant measures (resp. FINAE(a) has 
other invariant measures than the combinations of <5 z and <5 1 z ). It is in particular the case if 
a > 0.996. 

Proof. It is known that 0.6298 < p c < 2/3, see for instance Grimmett [J3]. This provides the 
bound {/l - (1 -p c ) 4 < 0.996. 

Let us consider the directed graph G = (N, A) such that the set of nodes is = 2Z x 2NU (2Z + 
1) x (2N + 1) and for each G A, there is an arc (oriented bond) from to (i — 1, j + 1) 
and one from to (i + + 1). 

Let S be some subset of 2Z x {0} called the source. The oriented bond-percolation on G of 
parameter p and source S is defined as follows: each node is open with probability p and closed 
with probability 1 — p, independently of the others, and a node of N is said to be wet if there 
is an open path joining it from some node of S. 

We say that the space-time diagram (xfykezjeN of FINAE(a) and the percolation model satisfy 
the correspondence criterion at time t if for each wet cell (k, t) of height t, we have x\ ^ xt k+i 
or xi ^ x\_ x . 
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For values of (a,p) satisfying a > y/l — (1 — p) 4 , Regnault is able to construct a coupling 
between FINAE(a) and the percolation model such that if the correspondence criterion is true 
at time t, it is still true at time t + 1. Let us take for the initial configuration of FINAE(a) 
the configuration x° defined by x° k = 1 if k is odd and x® = if n is even. We also choose 
S = 2Z x {0} for the percolation model. The correspondence criterion is true at time 0. By the 
coupling described in [25], the criterion is true at all time. 

Consider the percolation model and the probability P((0,2t) is wet). It is known (see for 
example [14J) that if p is strictly greater than a certain critical value p c , this probabil- 
ity, which decreases with t, does not tend to 0. Thus, for p > p c , there exists rj p > 
such that P((0, 2t) is wet) > r/ p for all t £ N. By construction of the coupling, we obtain 
P(xq* / xf or Xg* / z^i) > rjp for all t £ N. This proves that for a > y/l — (1 — p c ) 4 , the 
PCA FINAE(a) has at least one invariant measure which is not in the convex hull of the Dirac 
masses at the configurations "all zeroes" and "all ones" (take any accumulation point of the 
Cesaro sums obtained from the sequence obtained from the iterated of 5 x o by FINAE). This 
result can be translated to the PCA Majority. □ 



5.2.3 A duality result with the double branching annihilating random walk 

The aim of this subsection is to prove a duality result between FINAE(a) and a double branching 
annihilating random walk (DBARW). The connection between these two models is interesting 
in itself and could provide a new way to study the PCA Majority(a) for small values of a. A 
similar duality result was already obtained for interacting particle systems (see [7]), and the 
behavior of the DBARW is very well understood in continuous time (see [2Z]); but its study 
appears to be more difficult in discrete time. 



We now assume that a < 2/3 (in particular, Proposition 5.3 does not apply). 

Let us define a process (a4)feez,teN in the following way. For each (k,t) 6 Z x N, we first choose 
independently to do one (and only one) of the following: 

1. with probability a/2, draw an arc from {k — 1, t) to (k, t + 1), 

2. with probability a/2, draw an arc from {k + 1, t) to (k, t + 1), 

3. with probability a/2, draw an arc from (k—l,t) to (k,t-\-l), an arc from (k, t) to (k,t + l), 
and an arc from (k + l,t) to (k, t + 1), 

4. with probability 1 — 3a/2, draw an arc from (k,t) to (k,t + 1). 




We thus obtain a directed graph G\, that we will use to label each node of Z x N with a letter 
of {0, 1}. The nodes of Z x {0} are labeled according to the wanted initial configuration x°. A 
node labeled by a 1 will be interpreted as being occupied. A node (k, t) 6 Z x N is then labeled 
by a 1 if and only if there is an odd number of paths leading to this node from an occupied 
node of Z x {0}. This define a random field (x^)fcez,teN representing the labels of the nodes. 



22 



We claim that this field has the same distribution as a space-time diagram of FINAE(a) starting 
from x°. Indeed, the value x l k +1 is equal to x\_ 1 with probability a/2, to x k+l with probability 
a/2, to x\_ x + x k + x k+1 mod 2 with probability a/2 and to x k with probability 1 — 3a/2. And 
one can check for each value (x, y, z) 6 {0, l} 3 that these probabilities coincide with the ones 
obtained with the local function flip-if-not-all-equal. For example, if (a?jj._]_, x\, x k+i) = (0> 0' 1): 
the value of x k +l will be 1 if and only if case 2 or case 3 occurs, and they have together 
a probability a/2 + a/2 = a. If (x t k _ 1 ,x t k ,x t k+1 ) = (0,1,0), we will have x l k +1 = 1 if and 
only if case 3 or case 4 occurs, which has a probability a/2 + (1 — 3a/2) = 1 — a. And if 
( x k-ii x \-> x k+i) = (0)0,0) (resp. (1, 1, 1)), we will get a (resp. a 1) in all cases. 

We now consider the process (y^,)fcez,teN obtained from (a^fceZ.teN by reversing time. Formally, 
for each (ft, t) £ Z x N, we first choose independently to do one (and only one) of the following 
things: 

1. with probability a/2, draw an arc from (ft, i) to (ft — 1, t + 1); 

2. with probability a/2, draw an arc from (ft, t) to (ft + 1, t + 1); 

3. with probability a/2, draw an arc from (k,t) to (ft — l,t+l), an arc from (ft,t) to (ft,t + l), 
and an arc from (ft, t) to (A; + 1, t + 1); 

4. with probability 1 — 3a/2, draw an arc from (k,t) to (ft, f + 1). 




We thus obtain again a directed graph G2, that we will use to label each node of Z x N with a 
letter of {0, 1}. The nodes of Z x {0} are labeled according to the wanted initial configuration 
y°. A node labeled by a 1 will be interpreted as being occupied. A node (k,t) G Z x N is 
then labeled by a 1 if and only if there is an odd number of paths leading to this node from an 
occupied node of Z x {0}. This define a random field (y£)fcez,t6N representing the labels of the 
nodes. 

We claim that this field has the same distribution as the double branching annihilating random 
walk c that we now define. 

At time 0, a particles is placed on each cell k of Z such that y k = 1, and at each step of time, 
every particle chooses independently of the others do one (and only one) of the following things: 

1. with probability a/2, move from node k to k — 1; 

2. with probability a/2, move from node k to fc+ 1; 

3. with probability a/2, stay at node k and create two new particles at nodes ft — 1 and fc + 1; 

4. with probability 1 — 3a/2, stay at node k. 

If after these choices, there is an even number of particles at a node, then all these particles 
annihilate. If there is an odd number of them, only one particle survives. We set wt = 1 if and 
only if at time t, there is a particle at node k. 
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To summarize, we have the following relations: 

FINAE ~ (4 ) k ^ tm time ^ ersal (y*) feeZit6N ~ DBARW 



The processes (xfykezjeN and (ul)kez,teN are obtained one from another by reversing time. 
This can be used to get nontrivial information for FINAE. For instance, if A represents the set 
of occupied nodes at time for x, that is to say x° = 1a, we have the following duality relation: 

¥- =^(x{^x t l )=F-°= 1 Hx t k + x t l = l) 

= P(the total number of paths in G\ leading from A x {0} to (k, t) or (I, t) is odd) 
= P(the total number of paths in G2 leading from (k, 0) or (I, 0) to A x {t} is odd) 

= P y° =1 {k,i} (J2 y\ is odd) 

ieA 

<ps/ =i {M }(3i G A,yj = 1). 

Thus, to prove that the probability for the PC A FINAE that two cells k and I will be in different 
states at time t tends to as t tends to +00, it is sufficient to prove that in the DBARW, starting 
from two particles, the probability of extinction of the population of particles tends to 1. 

5.3 Experimental study 



We tried to get some numerical evidence for Conjecture 5.2 To study the PCA Majority 
experimentally, a first idea would be to consider the same PCA on the set of cells Z n , n 
odd. This does not work well. First, due to the state space explosion, computing exactly the 
invariant measure is possible only for small values (we did it up to n = 9 using Maple). Second, 
the algorithms of Section [4] cannot be applied since the EPCA is not ergodic. 

Instead, we use approximations of the PCA by NH-PCA on a finite subset of cells, the method- 



ology sketched in Section 4.5 Again, computing exactly the invariant measure is impossible 



except for very small windows. But now the sampling algorithms become effective. 

Let P be the PCA Majority. Set D n = {— n, . . . , n}, and let v be the uniform measure on {0, 1}. 
Consider the NH-PCA P(v,D n ). Let \x n be the unique invariant measure of P(v,D n ). We are 
interested in the quantity 

C n = ^n{x G X I XQ = X\ = 0} + fl n {x G X \ Xq = X\ = 1} . 



Indeed, by application of Lemma 4.13 , if lim sup n c n > 0, then there exists a non-trivial invariant 
measure for the PCA Majority on Z. 

Now the NH-EPCA is ergodic, so the sampling algorithms of Section [4] can be used. We were 
able to run the algorithms up to a window size of n = 1024 before running into overtime 
problems. The experimental results appear in Figure [9j with a logarithmic scale. We ran the 
sampling algorithms 10000 times. We show on the figure the confidence intervals calculated 
with Wilson score test at 95%. 

It is reasonable to believe that the top two curves in Figure [9] do not converge to while 
the bottom three converge to 0. This is consistent with the visual impression of space-time 



diagrams. It reinforces Conjecture 5.2 with a possible phase transition between 0.4 and 0.45. 



Acknowledgements. We used the applet FiatLux developed by N. Fates and available on 
his website (LORIA, INRIA Lorraine) to draw the space-time diagrams of Figures [l]and[9} 
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(c) a = 0.3 



Figure 9: Experimental study of Majority(a) (the configurations at odd times only are repre- 
sented on the space-time diagrams). 
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